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Abstract 



^ Dynamic Epistemic Logic makes it possible to model and reason about information 



change in multi-agent systems. Information change is mathematically modeled through 
epistemic action Kripke models introduced by Baltag et al. Also, van Ditmarsch inter- 
fT^ prets the information change as a relation between epistemic states and sets of epistemic 

CN states and to describe it formally, he considers a special constructor Lb called learning 

^_^ operator. Inspired by this, it seems natural to us that the basic source of information 

/^ change in a multi-agent system should be learning an announcement by some agents 

together, privately, concurrently or even wrongly. Hence moving along this path, we 
introduce the notion of a learning program and prove that all finite KA5 action models 
can be described by our learning programs. 



rr^ 1 Introduction 

> 

f~^ A computable function over strings of a finite alphabet is a function that can be computed 

CN by a Turing machine. A Turing machine takes a string as input, performs a sequence of 

^"i elementary changes on the string and if it halts, it provides another string as output of the 

^^ function. In recursion theory all Turing computable function can be obtained via some initial 

^T) functions: zero, successor, and projections through applying some basic operations such as 

^— I composition, primitive recursion and least search. In this paper, our goal is to develop a 

^ similar methodology for a class of epistemic functions. Following the same terminology, an 

epistemic function is a function that takes the epistemic state of a multi-agent system as 
input and yields a new episteinic state as output. The notion of epistemic function is the 
focus of Dynamic Epistemic Logic, Baltag et al. [HIS], and is formalized in action models. 
These functions act on Kripke models via an update operator and produce an update Kripke 
model. In this paper, we concentrate on those epistemic functions which can be coded as 
K45 action models. K45 models are those models which accessibly relations transitive and 
Euclidian. We claim that there are possible information changes which are not possible to 
encode them by KDAb or 5*5 action models. Consequently K4:5 action models are more 
powerful to describe epistemic functions than KD 4:5 and S5 action models. It is why we 
consider K45 action models instead of 5*5 or KD45 models. 

So far no one has looked at it from a computational aspect to answer the following 
question 

what are the initial functions and the basic operations by which all K45 epis- 
temic functions can be obtained? 



The basic source of information change in a multi-agent system is learning an announce- 
ment by some agents together, privately, concurrently or even wrongly. So the basic operators 
should be different kinds of learning. Van Ditmarsch et al. introduced a learning constructor 
in [71 [8]. We define our own learning operator which is different from van Ditmarsch's. 

As our initial functions, we take the test of any facts (p, that is ?(/?. For the basic 
operations, we take the following different kinds of learning: 1- alternative learning, 2- 
concurrent learning, 3- wrong learning, and finally, 4- recursive learning. Following the 
footsteps of recursion theory, we prove that all epistemic functions can be obtained through 
the test of facts by applying the above four basic operations. 

Epistemic logic, started with Hintikka's groundwork [14j . models and reasons about the 
knowledge of agents in a group |12| . In Epistemic logic, the notions of knowledge and belief 
are modeled in terms of the possible worlds (states). An agent knows or believes a fact if 
it is true in all the worlds that the agent considers possible as alternatives for the actual 
world. 

As information is transmitted, knowledge and belief of agents in a multi-agent system 
may change. The simplest cause for an information change is to announce some truth in 
public. Plaza in 1989 |15j, introduced a logic to formalize these changes and called it the 
public announcement logic. In public announcement logic, a fact is publicly announced 
in a multi-agent system and all agents together update their knowledge and belief. How- 
ever, more complex actions than such as private or dishonest announcements may occur, 
this is whereby different agents may have different views on some action [H El E] , or the 
announcement may not be truthful. 

Our work may be considered as a bridge between two paradigms in dynamic epistemic 
logic, namely Baltag et al. style action model and van Ditmarsch et al. epistemic actions. 

The paper is organized as follows. In section [2| we recall the definition of action models 
from [4j, and we explain what it means that a group of agents learns an action model. 
We discuss K4:5 Kripke models which only have transitive and Euclidian properties. We 
also introduce a new notion called the applicable formulas, and restrict the definition of 
satisfaction relation only to applicable formulas. 

In section [3j we introduce the initial functions and basic operations as the building 
blocks of the finite epistemic functions. 

In section [4j we add the new operator of recursion in constructing the finite epistemic 
functions. 

Finally, in section [Sj we compare our work with other related works, and further works 
may be done. 

2 Backgrounds 

In this section, we introduce two significant notion: 1- learning an action model, 2- K4:5 
Kripke models and applicable formulas. 

2.1 Action models 

We start by recalling the definition of an action model from Baltag [1]. A pointed action 
model {N, sq) is a tuple N = {S, (^a)agA,pre), where 5 is a set of events, ^ is a set of 



agents, ^a is an accessibility relation for agent a on events in 5, and pre is a function that 
assigns to each event, a formula of an appropriate epistemic language, as a precondition for 
that event, so G 5" is called the actual event. 

Announcements of facts in multi-agent systems give rises to information changes, where 
different agents have different access to the resource of the announcement, and also different 
views about the access of other agents to the resource. 

For instance, consider the pointed action model (A^i,s) in Figure 1, where S = {s,t}. 



t, t --a t, pre{s) = ip,pre{t) = ip. 
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Figure 1 

Here {Ni,s) encodes the following information change 

" (f is announced whereas agent a (wrongly) learns ip." 

One may assume that ^ = green and ip = blue, are two different colors. Then the action 
model (A'^i, s) says that a green ball is shown to a, whereas agent a thinks that she sees a 
blue ball. 

Consider the pointed model {N2,s) in Figure 2, where S = {s,t}, s ^^ s, s ^a t, 
t ^a t, pre{s) = if and pre{t) = ip. 
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Figure 2 

This action model encodes the following information change: "a green ball is shown to 
agents a and b, agent b sees a green ball and is aware that agent a has a color-blindness and 
sees a blue ball. Agent a just sees a blue ball and has no idea about what b sees" . 

Remark 2.1 The word ^^having no idea" used above is vague, and needs to be clarified. 
In the action model A^i, agent b is not present in the state t. So agent a has no idea about 
the information of agent b. There could be lots of possibilities about the color-blindness of 
b at state t, but the action model says nothing about it. We will later introduce the notion 



of applicable formulas (see Definition 2.7) to formally model this case. 



For another example, consider the pointed model {N^, s) in Figure 3, where S 
^h s, t ^h s s ^a t, t ^a t, pre{s) = (f and pre{t) = if). 



{s.t}, 
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Figure 3 

The action model (A^3,,s) encodes the information change that "a green ball is shown to 
agents a and b, agent b sees a green ball and is aware that agent a has a color-blindness 
and sees a blue ball. Agent a sees a blue ball and wrongly thinks that agent b has a 
color-blindness and sees a green ball. Moreover, both agents are aware about each other's 
thoughts" . 

According to the above discussion, a pointed action model (N, s), where N = {S, (^a 
)a£A,pre), encodes the information change that 

the fact pre{s) is announced whereas each agent (relevant to its accessibility rela- 
tion in the action model) acquires information of what may have been announced 
and what other agents may have heard. 

2.2 Learning an Action Model 

We should also clarify what it means that a group of agents learns an action model. Suppose 
{N, s) is a pointed action model, and B is a group of agents. The case that 

"group B learns (A^, s)", 

is a new action model which encodes the following information change 

"the fact pre{s) is announced and group B learns the fact pre{s) and the infor- 
mation that other agents (excluding B) acquires due to learning (A^, s)". 

For example, recall the pointed action model (A^i,s) in Figure 1, where it encodes the 
following information change 

"(/9 is announced whereas agent a (wrongly) learns ip" . 

An agent b learns (A^i, s) means 

"(/9 is announced and agent b learns that (p is announced and the fact that agent 
a (wrongly) learns ip. Agent a still wrongly learns ^p and has no idea about what 
b learns". 

The case that agent b learns {Ni,s), denoted by L5((A^i,s)), is encoded by the pointed 
action model {N2,s) in Figure 2. 

Four different kinds of learning, in a multi-agent system, may be distinguished: 

1. Alternative Learning: a group of agents, B, together learns that among a set of 
actions (Mi, si), (M2, S2), ••., (M^, Sk), one of them is the actual action. 



Example 2.2 Let (Mi,si) be the pointed action model (A^i,si) in Figure 1, and 
(M2,S2) be another action model, in which M2 = {S = {s2},9,pre{s2) = x)- Then 
the pointed action model of Lb{{Mi, si), (M2, S2)) is 



KEnKSD" 



't 
Figure 4 

See Definition 13.21 for details. 



2. Concurrent Learning: two disjoint groups of agents Bi and B2 learn concurrently 
but not together; group Bi learns (A''i,si) and group B2 learns {N2,S2) concurrently 
but not together. 

Example 2.3 Let (Mi,si) be the pointed action model (A^i,si) in Figure 1, and 
(M2, S2) be another pointed action model, in which M2 = {S = {S2}, (s2 ^6 S2),pre{s2) 
if). Then the pointed action model of (Mi, si) n (M2, S2) would be 
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Figure 5 
See Definition [3^ for details. 



3. Wrong Learning: whereas a fact tp is announced, a group of agents, B, wrongly 
learns something else. 

Example 2.4 ip is announced and agent a learns ip. See the pointed action model 
(A''i, si) in Figure 1. 

4. Recursive Learning: a group of agents, Bi, learns what another group of agents, 
B2, learns, and group B2 learns what group Bi learns. 

Example 2.5 Consider the pointed action model (N^js) in Figure 3. 
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Figure 3 

The action model (N^js) means 
If is announced and agent b learns 

{ ip and the case that agent a (wrongly) learns 
{ip and about what b (wrongly) learns} 

}■ 

2.3 Epistemic Logic 

In this section we briefly go through the syntax and semantics of epistemic logic. The syntax 
of epistemic logic is as usual, but the semantics is a little bit different from the standard 
one. 

Definition 2.6 Let P be a non-empty set of propositional variables, and A be a set of 
agents. The language L{A,P) is the smallest superset of P such that 

if ifjil! £ L{A,P) then^ip, {(pAip), Knp £ L{A,P), 

for i £ A. 

For i £ A, Kif has to be read as 'agent i believes (knows) ip". For a group of agents 
B (^ A, Kb^ means that Knp, for all i £ B. 

Epistemic logic models the notions of knowledge and belief in terms of the notion of 
possible worlds in Kripke semantics. 

Definition 2.7 A Kripke model M is a tuple M = (S*, (^j)jg^, y), where S is a non- 
empty set of worlds (states) s £ S, V is a function from P to 2 , and each -^i is a binary 
accessibility relation between worlds. We define the group present at the state {M, s) as 
follows: 

gr{{M, s)) = {i£A\ {3t £S) s^it}. 

For an epistemic state {M,s), the set of applicable formulas at the state {M,s), denoted 
by ^(M,s) ^ -^(^)-P); is the .smallest subset satisfying the following conditions 

1. P^<^(M,s), 

2. if ^,^ £ ^(M,s) than V? A -0 G ^(M,s) and ^tp £ $(m,s); 



3. Kiif G ^(M,s) if C'l^d only if i & gr{(M, s)) and for all t such that s ^j t, (f € ^(M,t) ■ 

Intuitively, the applicable formulas of an epistemic state, are those formulas that can 
sensibly be assigned a truth value. For example, consider the Kripke model in Figure 6. 

Figure 6 

As agent b is not present in the world t, formulas like K^x are not applicable in the world 
t (it is not possible to talk about the truth of K^x in world t, where agent b is not present 
in this world). Also the formula Ka{Ki,py K^^p) is not applicable at the world s. In the 
next definition, we restrict the definition of truth to applicable formulas. 

Definition 2.8 In order to determine whether an applicable formula (p G ^(m,s) ^-^ true in 
the epistemic state {M,s), denoted by {M,s) |= if, we look at the structure of ip: 

{M,s)\=p iff seV{p) 

(M, s) 1= {lp a ^) iff (M, s)^if and (M, s) |= ip 

(M, s) \= -^p iff not (M, s) |= 99 ((M, s) ^ p) 

(M, s) 1= Knp iff for all t such that s ^i t, (M, t) |= p 

Note that the satisfaction relation is just defined for applicable formulas. 

The standard epistemic logic 5*5 consists of axioms Al — A5 and the derivation rules Rl 
and R2 given below 

Rl: \- ip, \- p -^ il^ ^ \- Ip 
R2: h p^ Kip, for all i e A 

Al: Axioms of propositional logic 

A2: {Kip A Ki{p -^ V)) ^ ^iV' 
A3: Kip -^ p 
A4: Kip -^ KiKip 
A5: -^Kip —7- Ki^Kip 

If instead of yl3, we assume the weaker axiom D (given below), the logic of belief KDAb 
will be specified. 

D:^{Kip^Ki^p). 

Definition 2.9 Let M = (5, (^a)aeA,^) be a Kripke model. For each a £ A, we say that 
the relation —^a is 

1) reflexive if and only if for all s £ S, s ^^ s; 

2) serial if and only if for all s £ S, there exists t £ S such that s -^a t; 

3) transitive if and only if for all s,t,u £ S, if s ^-a t and t ^-a u then s ^a u; 
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4) Euclidean if and only if for all three states s,t,u G S if s ^a t o-nd s ^a u then 
t --a u. 

We say a relation ^a is 

• S5 whenever it is reflexive, transitive and Euclidean, 

• KD4:5 whenever it is serial, transitive and Euclidean, and 

• K4:5 whenever it is transitive and Euclidean. 

A Kripke model M is called an 5*5 model if and only if for each agent a, ^a is S5. It is 
obvious that every 55 model is a model of the standard epistemic logic. A Kripke model M 
is called an KD45 model if and only if for each agent a, ^-a is KDAb. 

We also say a Kripke model M to be a KAf) model if and only if for each agent a, ^a 
is if 45. 

An action model N is called a if 45 model if and only if for each agent a, ^a is if 45, 
and it is an S'5 model if and only if for each agent a, -^a is 5*5. For example, the action 
model in Figure 2 is if 45. 

Definition 2.10 Let P he a set of atomic formulas, A he a set of agents, and ^ be a set 
of epistemic formulas over atomic formulas P and agents in A. 

We use FAct(<l>) (or simply FActJ to refer to the set of all pointed epistemic action 
models (N, t) such that N is a finite if 45 model, and the image of pre is ^. 

We use Mod{A, P) (or simply Modj to refer to the set of all epistemic states (M, s) 
such that M is if 45. 

We recall definition of bisimulation of actions from [^. Consider two action models 
N = {S, {^'a)a£A,pre) and N' = {S' ,{-^'f^)a<^A-,pfe'). The pointed action model (N, s) is 
bisimilar to (N',s'), denoted by (N, s) ~ (N',s'), whenever there is a relation TZ Q S x S' 
satisfying the following conditions, for each agent a £ A: 

Initial. TZ{s, s'). 

Forth. If TZ{t, t') and t ^a v, then there is a u' G S' such that Tl{v, v') and t' ^^ v' . 

Back. If lZ{t, t') and t' ^^ v' , then there is a v G S* such that lZ{v, v') and t ^a v. 

Pre. If TZ{t,t'), then pre{t) is equivalent to pre{t') in KDib belief logic. 

We define another notion of equivalence on action models and call it agent-hisimulation. 

Definition 2.11 Let a & A he an arbitrary agent. Two pointed action models (N, s) and 
(N',s') are a-hisimilar whenever 

Forth. If s ^a t, then there is a t' £ S' such that s' ^'^ t' and (N, f) ~ (N',t'). 

Back. If s' ^^ t' , then there is a t £ S such that s ^-a t and (N, f) ~ (N',t'). 



The execution of a pointed action model (N, t) £ FAct on an epistemic state {M, s) £ 
Mod is a new epistemic state (M * N, [s, t)), where M * N = (5, (^j)jgA, V) and 

S = {{si,ti) I si G N,ti G M,(M,si) |=pre(ti)}, 

(si, ti) ^i {S2,t2) iff si ^i S2 and ti ^j t2, 

{si,h)£V{p)iSsi€V{p). 

Proposition 2.12 Suppose (N, t) G FAct and (M, s) G Mod. T/ien (M * N, {s, t)) G Mod, 
i.e., it is a KAb model. 

Proof. It is straightforward. H 

2.4 Why KAb Models and Applicable Formulas? 

KAh Kripke models are more general than KDAb models, as they necessarily do not have 
the serial property. The reason that we consider serial property for KD45 model is that, we 
want the agent's belief to be consistent. For K4:5 Kripke models, we consider the definition 
of satisfaction relation just for applicable formulas. In this way, the agent's beliefs are 
consistent at each state. Moreover, 

regarding applicable formulas, the class of i^45 models is a (sound and complete) 
semantics for logic of belief KDAb. 

Assume that a formula ip is derivable from the logic of belief, i.e., KDAb h 99. Then for 
every KAb pointed model (M, s), if c/? is applicable at this state, then (M, s) |= if. Also, note 
that every KD45 model is also a Kib model. So for any formula (p, if for all i^45 pointed 
model (M, s) which 99 is applicable at the state, we have (M, s) |= cp then KDAf> h ip. 

Since i^45 models are more general than KDAb models, we can encode more epistemic 
functions in i^45 models, as we do not have to determine the cases where an agent does 
not have any idea about the belief of another agent. For example, consider the following 
information change: 

a green ball is shown to agents a and b, agent b sees a green ball and is aware 
that agent a has a color-blindness and sees a blue ball. Agent a just sees a blue 
ball and has no idea about what b sees. 

In the above information change, agent a has no idea about what b sees. Agent a says I 
do not have any idea, there could be lots of possibilities and I do not know even how many 
possibilities exist, may be an infinite number of them. It would be possible that agent b 
sees a cube instead of a ball, or even an elephant, and etc. It would be possible that agent 
b sees the ball in a color which is unknown for me. 

In KD4:5 models, we are forced to encode all possibilities, but if it happens that some 
possibilities are unknown, then we don't know what to do. However such information change 
can be encoded by the K45 action model in figure 2. 

Therefore, considering applicable formulas, the class of Kib models is still a semantics 
for logic of belief (similar to the class of KDAb models) and moreover, they are enough 
general than KDAb models for describing information changes formally. 

9 



3 Basic Learning Programs 

As mentioned in the introduction, our goal is to introduce some initial functions and basic 
operators as the building blocks of all finite epistemic functions (FAct). In this section, we 
introduce the first class of epistemic learning programs, called basic learning programs. The 
reason these are called basic is that they do not include any recursion in their structure. 

Definition 3.1 Let ^ be a set of epistemic formulas over a set of atomic formulas P and 
a set of agents A. The set of basic learning programs BLP(<1>) is defined as follows: 

i. Test, for all (p & ^, ?(p is a basic learning program, and we define group{?f) = 0, 
and pre(7if) = f, 

a. Alternative Learning, for all n £ N, and B C A, if ai, a2, ■■■,cXn are basic 
learning programs, then LB{ai,a2, ■■■,ctn) is a basic learning program, and we define 
group{LB{ai,a2, ■■■,an)) = B U group{ai) and pre{LB{ai,a2, ■■■,an)) =pre{ai). 

Hi. Concurrent Learning, if ai,a2 are basic learning programs such that pre{ai) = 
pre{a2) and group{ai)r\group{a2) = 0, then air\a2 is a basic learning program. We 
define group{ai n 02) = group{ai) U group{a2), and pre{ai n Q2) = pre{ai), 

iv. Wrong Learning, ifai is a basic learning program andtp is an epistemic formula, then 
^\b<^i is a basic learning program whenever B C group[ai) . We define group{ip\Bai) = 
B and pre{ip\Bai) = tp. 

To each basic learning program, we associate a pointed action model as follows. 

Definition 3.2 Semantics of BLP. 

1. for all f £ ^, the pointed action model of the program ?{p, is (N?^, s?^), where 

N?^ = ({s?^}, {^a)aeA,pre), 

in which for all a £ A, ^a= <ind pre{s?^) = ip. 

2. Suppose ai, 02, ..., Ofc ^'"e basic learning programs and their associated action models 
are {Ni, si), {N2, S2), ..., (Nk, Sk), where Ni = {Si,{^'-Ja£A,prei), for 1 < I < k, 
and rii<i<kSk = 0- Then the associated action model to the basic learning program 
LB{ai,a2,...,an) is {N,s), where N = {S,{^a)aeA,pre) and 

— S = {(Si, 1), (S2, 1), ..., {sk, 1)} U 5i U ^2 U ... U Sk, 

— for all b £ B, for all 1 < i,j < k, if Si is b-bisimilar to Sj then (sj, 1) -^h {sj, 1), 

— for all 1 < j < k, for all a £ gr{Nj,Sj) — B, for all t £ Sj, if Sj ^a t then 

[Sj, i) ^a t, 

— for all a £ A, for all 1 < i < k, for all v,t £ Si, if v -^l^ t then v ^a t, 

— s = (si,l), 

— pre{{si, 1)) = pre{si) for all 1 < I < k. 

10 



The associated action model ofLsiai, 02, ■■■, ««) means: pre{ai) is announced whereas 
agents in B learn that either qi or 02 or ... or a^ has been executed, and what other 
agents learn in each alternative case. 

3. Suppose ai,a2 are basic learning programs and their associated action models are 
(iVi,si) and (iV2,S2) respectively, where Ni = {Si, {^g^)a£Ai,prei) for 1 = 1 or I = 2, 
Ai n A2 = 9 and Si (1 S2 = 9- Then the pointed action model of the basic learning 
program a\ n 02 «s (-A^, -s) with N = {S, {^'a)a<=A,pre), where 

— S = {s} U 5*1 U 5*2, for some s ^ SiU S2, 

— for all 1 < i < 2, for all a G Ai, if v ^^ t then v ^a t, 

— for all 1 < J < 2, for all a £ Aj, if Sj ^i t then s ^^ t, 

— pre{s) = pre{si). 

The associated action model of ai n 02 means: pre{ai) is announced whereas agents 
in group{ai) learn according to execution of ai, agents in group{a2) learn according 
to execution of 02 ■ 

4. Suppose a is a basic learning program and its associated action model is {Ni,si), 
where Ni = (S*, (^^)ag^,prei). Then the associated action model of il)\BOt is {N,s) 
with N = {S, i^a)aeA,pre), where 

— S = {s} U 5*1, for some s Si, 

— for all a (z A, if v ^^ t then v ^a t, 

— for all b £ B, if si ^^ t then s ^b t, 

— pre{s) = ip. 

The associated action model ofiplsai means: ip is announced whereas agents in group 
B wrongly learn according to execution of ai . 

The action model of a program a is denoted by {Na,Sa)- An epistemic action (N, t) G 
FAct($) is called basic learning action whenever there is a basic learning programs a G 
BLP($) such that {Na,Sa) is bisimilar to (N, t). 

Example 3.3 The action model associated with the basic learning program 

Lb{^p\aLJ^p). 

is bisimilar to the action model illustrated in Figure 2. 

The action model illustrated in Figure 4 is a basic learning action. It is bisimilar to the 

action model associated with the basic learning program 

L5(v?|aLa?V',?x)- 
Also, the action model associated with the basic learning program 

iplaLa^lpriiplbLb^ip. 

is bisimilar with the action model illustrated in Figure 5. 
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Definition 3.4 Let N = {S, (^^a)aeA,pfe) be an K45 action model. 

• An S5 submodel of N is an S5 action model M = (S", (^^)agB,pre' 
pre' = pre\s', B <^ A and for all a £ B, —^'a=^a \s'- ^^ S5 submodel is called 
connected whenever all two different states of the model are reachable from each 
other. It is called closed whenever for all s,t £ S, if s £ S' and for some a G B, 
s ^a t, then t £ S' . 

• Let M' = (S", {^^)aeB-,P're') and M" = {S" , i^^a)aec,P^^") be two closed connected 
S5 action submodels of N . We write M' < M" whenever S' C S" and B <^ C. One 
may easily verify that < is a partial order relation. 

• Assume M* = (S"*, (^^)^g^i,pre*), 1 <i < k are all different maximal closed con- 
nected S5 submodel of N with respect to the partial order relation < . We construct 
the action model T{N) = {S' , {^'^)aeA,pre') as follows: 

-5' = Ui<.<fe(5^xW) 

— {s,i) ^^ {t,j) if CLiT-d only if either i = j and s ^^ t, or i ^ j, a B^, a £ B^ , 
and s -^a t (in the action model N). 

— pre'{{s,i)) =pre^{s). 

• The projection of a state {s,i) in the action model T{N), denoted by Il{{s,i)), is 
defined to be the state s in the action model N . 

• For the action model N , we define the directed graph of N , denoted by G{N) as 
follows: 

— Each node G{N) is a maximal closed connected S5 submodel M* of N . 

— Between two different nodes M^,M^, there exists a directed edge M* ^^ M^ 
if and only if there exists an accessibility relation in the action model T{N), 
{s,i) ^'^ {t,j), for some s G Si, a £ A, and t £ Sj. 

It is obvious that an 5*5 submodel of a K45 action model may be connected but not 
closed and vice versa. 

Proposition 3.5 Let N = {S, {—^a)aeA,P'>'£) be a KAb action model. Then {N,sq) is 
bisimilar to {T(N),wo) for all sq and wq for which the state sq is the projection of wq. 

Proof. See the Appendix. H 

Theorem 3.6 // a pointed KAb action model {N, s) is a basic learning action, then its 
graph G{N) is a tree. 

Proof. See the Appendix. H 

Example 3.7 Consider the action model {N^, s) shown in Figure 3. There is no basic 
learning program a such that its action model is (N^js). That is because G(iV3) is not a 
tree. 
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We showed that the graph of the basic learning actions are trees. The converse is also 
true, that is, for each iC45 pointed action model {N,s), if its graph is a tree then it is 
associated to a basic learning program up to bisimilarity. 

Proposition 3.8 All S5 pointed action models are basic learning actions. 

Proof. See the Appendix. H 



Theorem 3.9 // the graph of a finite K45 pointed model {N, s) is a finite tree, then {N, s) 
is a basic learning action. 

Proof. See the Appendix. H 



3.1 Comparing two Learning Operators 

In this part, we aim to compare our proposed learning operator with the learning operator 
introduced in [8j. We begin with the same example "Lecture or Amsterdam" discussed 
inlU. 

Anne and Bert are in a bar, sitting at a table. A messenger comes in and 
delivers a letter addressed to Anne. The letter contains either an invitation for 
a night out in Amsterdam or an obligation to give a lecture instead. Anne and 
Bert commonly know that these are the only alternatives. 

Consider the following information change scenario: 

• (spy-seeing). Bert says good bye to Anne and leaves the bar. During his leaving, he 
secretly spies from the window of the bar that whether Anne reads the letter, Anne 
does not get aware that Bert spies on her, and wrongly thinks that she is alone (Bert 
is not present) while she reads the letter. 

Suppose that p stands for "Anna is invited for a night out in Amsterdam", and also 
assume that in fact p is true. 

It is not possible to model the above information change scenario in concurrent dy- 
namic epistemic logic [8j. But we can model above scenario by the following basic learning 
programs. 

L;,(La?p, La?-.p). 

The pointed action model associated to the above learning program is {N,s), where S = 
{s,t,v,u}, s ^a V, V ^a V, t ^a u, u ^a u, and s ^b s, s ^fc t, t ^ft s, t ^b t, and 
pre{s) = pre{y) = p, pre{t) = pre{u) = ^p. 
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The pointed action model associated to Lb{La?p, La^^p) 

A candidate to describe the action "spy-seeing" in formalization presented in |8| could be 
Lh{\La'!pU La'^^p) . But the term Li){\La'!pU La'! ^p) is not a well-formed action in concurrent 
dynamic epistemic logic since it does not satisfy definition 6 of [8]. In this definition if Lsa 
is a well-formed action then gr{a) C B. Whereas, gr{La'!pU La'!^p) = {a}, and {a} ^ {b}. 
For another example, we discuss the following information change scenario: 

• (spy-reading). Bert says good bye to Anne and leaves the bar. using a hidden 
camera, Bert spies on Anne when she reads the letter, and Bert gets aware of the 
contents of the letter using the camera. Anne wrongly thinks that she is alone, and 
there is no spy on her. 

Again, it is not possible to model the above information change scenario in concurrent 
dynamic epistemic logic [8]. But we can model above scenario by the following basic learning 
programs Lh{La'!p). 



'QB 



p 

The pointed action model associated to Lb{LaTp) 

The pointed action model associated to the above learning program is {N,s), where S = 
{s, t}, s ^b s, s ^a t, t ^a t, and pre{s) = pre{t) = p. 

Again note that the term Li){La'!p) is not a well-formed action in concurrent dynamic 
epistemic logic as {a} ^ {b}. 

We can also model scenarios introduced in ^ via basic learning programs as follows: 



1- (tell). Anne read the letter aloud: Lab!p- 



a,b 

O 







The pointed action model associated to Lab^P 
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2- (read). Both Bert and Anne are present in the bar, sitting on a table, and Bert is 
seeing that Anne reads the letter: LabiLa'^p, La'f^p) . 

The pointed action model associated to Lab{LaTp,LaT~>p) 

3- (mayread) Bert orders a drink at the bar so that Anne may have read the letter (and 
actually Anne reads the letter) 

Lab(a,/3,7), 

where 



— a 



Lalp^p\bLiP.T (Anne reads the letter, and in fact p is true), 
La'^^pCi -ip|fcLfc?T (Anne reads the letter, and in fact -ip is true), 
La?T n Lb?T (Anne does not read the letter) 



- The pointed action model associated to a, denoted by (A''!, si), is 5 = {si,ti,vi}, 
si ^a h, Si ^b vi, h ^a h, vi ^b vi, and pre{si) = pre{ti) = p, pre{vi) = T. 

- The pointed action model associated to /3, denoted by (A^2, S2), is S" = {s2, ^2, "^2}, 
S2 -^a t2, S2 ^b V2, t2 ^a ^2, ^2 ^b V2, andpre{s2) = pre{t2) = -ip, pre{v2) = T. 
The pointed action model associated to /?, denoted by {N3, S3), is S" = {S3, ts, V3}, 
•S3 ^a h, S3 ^b V3, t3 ^a h, V3 ^b V3, and pre(s3) = pre{t3) = pre{v3) = T. 

One may easily check that for every i,j G {1,2,3}, {Ni,Si) is 6-bisimilar to {Nj,Sj). 
Also, for every i,j £ {1, 2, 3}, {Ni, Sj) is a-bisimilar to {Nj, Sj) if and only if i = j. 

Therefore, the action model associated to Labia, (3,^) is (A^, 1), where S = {1,2,3}, 
1 ^a 1, 2 ^a 2, 3 ^a 3, and 1 ^^ 1, 2 ^b 2, 3 ^b 3, and 1 ^f, 2, 2 ^b 3, 3 ^b 1, 
and pre(l) = p, pre{2) = -ip, and pre(3) = T. 

4- (bothmayread). Bert orders a drink at the bar while Anne goes to the bathroom. 
Both may have read the letter (and actually both of them have read). 

LabiLJp n Lb?p, LJ^p n Lb7^p, La?T D L^TT, LJp D plbL^TT, La^p D 
-.p|bLb?T, Lbfp n plaLJT, Lb^p n -.p|aLa?T). 

Four above information change scenarios are also modelled in concurrent dynamic epis- 
temic logic in example 7, of |8j. Also, in Figure 1 of [8], page 4, Epistemic states resulted 
from the execution of actions (for these scenarios) described in concurrent dynamic epis- 
temic logic (example 7, of [8]) is shown. It easy to verify that 

Epistemic states resulted from the execution of pointed action models associated 
to basic learning programs for these scenarios (introduced above) are exactly the 
same Epistemic states resulted from the execution actions described in concur- 
rent dynamic epistemic logic fE/ (and shown in Figure 1 of the same reference, 
in page 4)- 
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4 Learning by Recursion 

By learning by recursion, we mean the cases where agents learn about each other's learning, 
i.e., an agent a learns something about learning of another agent b and agent b also learns 
about learning of agent a. In this way, a recursive learning occurs. In this section, we 
introduce recursive learning actions to model this type of learning. 

In the following definition, undf, indicates that the term is undefined. 

Definition 4.1 Let ^ be a set of epistemic formulas over a set of atomic formulas P and 
a set of agents A, and Var = {X,Y, Z, ...} be a set of variables. The set of open terms 
OpenT($) is defined as follows: 

1. for all X G Var, X is an open term, we let pre{X) = undf and group{X) = undf , 

2. for all if ^^, lip is an open term, group{lip) = 0, and pre{?ip) = ip, 

3. for all n E N, and B (^ A, if ai,a2, ■■■,an are open terms, then LB{ai,a2, ■■■,an) is 
an open term, and we let 

group{LB{ai, 02, •••, On)) = B U group{ai), and 
pre{LB{ai, a2, ■■■, an)) = pre{ai) 

Note that the left sides are defined if group{ai) and pre{ai) are defined. 

4. ai n 02 is an open term whenever ai and 02 are open terms. If both group{ai) , 
group{a2) are defined and group{ai)rigroup{a2) = 0, and if both pre{ai) , pre{a2) are 
defined and pre{ai) = pre{a2). Then we let group{a\ n a2) = group{ai) U group{a2) 
and pre{ai n a2) = pre{ai), 

5. V'l-BCKi ^s an open term whenever ai is an open term. If group{ai) is defined and 
B C group{ai) . We then define group{tp\BCii) = B and pre{ip\BCti) = V^- 

Definition 4.2 Assume a(Xi,X2, ...,X^.) is an open term with k variables, the tuple 

(/3i,/32,...,/3fc)GOpenT($) 

is a suitable substitution for a{Xi,X2, ...,Xk), whenever a{Pi, (32, ..., (3k) G OpenT(<I>). 

Now we define open action models, in order to be associated to open terms. For each vari- 
able X, let {Nx, sx) be a variable pointed action model, where Nx = {S^ , {-^-^)a<^A,pfe^) 
and the set Sx, the relations ^^ > ^^^ the function pre are variables. The open action 
model of an open term a(Xi, X2, ...,Xfc) is simply constructed using variable pointed ac- 



tion models (iVxi,sxi), {XX21SX2), •••> and {Nx^^sx^) aiid Definition 3.2 Substituting an 



action model {N, s) in an open term a{X) is obtained by considering s instead of sx- 

Example 4.3 Suppose A = {a, 6}, and consider the open term LhLa{X). The open action 
model {Xi^i^(^x)j SLi,La{x)) is constructed as follows. We consider three states sq, si and sx, 
where pre{so) = pre{si) = pre{sx)- The actual state is sq, and the accessibility relations 
are sq ^^ sq, sq ^a si, si ^a si, and , si ^j, t for all t G &[sx], where b[sx] is the set of 
all states in the hypothetical model Nx in which sx has 6-accessibility to them. The open 
action model of the term LhLa{X) is illustrated in Figure 8. 
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■si 



A^ 



A' 



/ 


Sx 


\ 


b[sx] 



Figure 8 

1. Let (M, to) be the action model of the term Lab{'^^), and S = {to}, pre(to) = ^, to ^a to 
and to ^b to- 

a,b 

o 



Figure 9 

Substitution of (M, to) for {Nx,sx) means to substitute to for sx- Therefore, pre{sx) = 
pre{to) = if and b[sx] = b[to] = {to}. Substituting (M,to) in (A^L6L,(x), ■sl^l^(x)) yields to 
the following action model 




so 



'P 



«Q® 



Sl 



Figure 10 
which is the action model of LaLbLabi^f)- 

Example 4.4 Suppose A = {a,b}, and consider the open term LbiiflaLaiiplbLbiX))). The 
open action model {NL,(^\^L4i,\,L,{x))),SL,(^i^La{i>\tLt{x)))) is shown in Figure 11. 
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LMNx,^-x)) 




^Lbi'M^-aii'lbLiiX)))) 



Figure 11 

i^Lt((Nx,sx))^ ^Li,{(Nx,sx))) ^^ obtained by adding the new state (sx,l) as the actual 
state and adding new accessibihty relations {sxA) ^b {sxA), and (sx,l) ^a t, for all 
t £ a[sx]- We have pre{{sx, 1)) = pre{sx), and sl^CCA^x.^x)) = i^x, 1)- 



{N^(,\iLiix),s^\tL,(x)) is obtained by adding the new state s^i^l^CX) to iVL,((7v 



x,sx))' 



and 



adding new accessibility relations s^|j^2.b(x) ^fe ^i foi' all t G H^Lt{(Nx,sx))^- Moreover, 

Continuing the above scenario and using Definition 3.2 the open action model in Fig- 
ure 11 is constructed. 

To obtain the open action model of Lb{ip\aLa{tp\bLb{Lf,{ip\aLa{'ip\bLb{X)))))), one may 
simply consider two copies of the open action model (A''/^j((p|^L^{^|jLj,(x))) and replace the 
actual state si^(^\^La{ip\bLb{x))) of one of them for sx in another one, and obtain the model 
shown in Figure 12. By substituting SL^(^^\^L4i,\kL,{x))) for sx, the set a[sx] is {s^\i,Lk{x)}- 
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SL,{^\^La{i.\,Li,(Li,{<^'\aL,{^'\iLf,(Xm)}} 



N 




Lb{'f\aLa(i'\bLb{X))) 



Lt{{Nx,sx}) 



Figure 12 

Fixed point of Li,{ip\aLa{il^\bLh{X))) . To construct an action model as a fixed point 
of the term Lb{ip\aLaiijj\bLb{X))), we consider the open action model (A''Li,(<^ULa(v|6L6(x))) 
and replace its actual state SL5((/3|aLa(i/'|6i6(^))) ^°^ ^^ (^^^ Figure 13, the symbol s^X will 
be defined in the next section). 



<® 



^nXXii'-plaLaivlbi-iiX))) 



IP U— ^ V-' 



Figure 13 
It is easy to verify that the pointed action model in Figure 13 is bisimilar to the pointed 



action model {N2„s) explained in Example 2.5 as a recursive learning action. 
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4.1 Recursive Learning Programs 

One may check that the graph of the action model shown in Figure 3 is not a tree and 



thus by Theroem 3.6, it is not a basic learning action. So we cannot describe it in terms 
of alternative learning, L^( — ,—,..., —), concurrent learning n, and wrong learning, 1^, 
operators. We add a new operator fi to the language for recursive learning and show that 
the action model shown in Figure 3 is a recursive learning action. To do this, we need to 
slightly modify the Definition of open terms |4.1| as follows. 



Definition 4.5 Let ^ be a set of epistemic formulas over a set of atomic formulas P and 
a set of agents A, and Var = {X, Y, Z, ...} be a set of variables. The set of open terms, 
0T($) is defined as follows: 

1. for all X € Var, X is an open term, we let pre{X) = undf and group{X) = undf , 

2. for all if G L{^), lip is an open term, group{?ip) = 0, and pre{?ip) = ip, 

3. forallnGN, and B (1 A, i/ 01,02, ...,«« are open terms then Lb {ai,a2, ■■■, an) is 
an open term, and we let 

group{LB{ai, 02, .-., a„)) = B U group{ai) and, 
pre{LB{ai,a2, ■■■, On)) = pre{ai) 

Note that the left sides are defined if group{ai) and pre{ai) are defined. 

4. oi n 02 is an open term whenever oi and 02 are open terms. If both group{ai) , 
group{a2) are defined and group{ai)r\group{a2) = 0, and if both pre{ai) , pre{a2) are 
defined and pre(oi) = pre{a2). Then we let group{ai Ci 02) = group{ai) U group{a2) 
, and pre{ai n 02) = pre(oi), 

5. iplsoi is an open term whenever oi is an open term. If group{ai) is defined and 
B C group{ai). We define group{jp\BCei) = B and pre^ijjlBCei) = ip. 

6. If a{Xi,X2, ...jXi;.) is an open term, and both 

group{a{Xi,X2,...,Xk)) and 
pre{a{Xi,X2,...,Xk)) 

are defined, then fiXi.a{Xi,X2, ...,Xk) is an open term with k — 1 free variables; the 
variable Xi is bound under fiXi . We define 

group{^iX.a{Xi,X2,...,Xk)) = group{a{Xi,X2, ...,Xk)) and 
pre{nX.a{Xi,X2, ..., Xk)) = pre{a{Xi,X2, ..., X^)). 

A term a S 0T($) is closed if it has no unbounded variable. 

Definition 4.6 Let ^ be a set of epistemic formulas. The set of recursive learning pro- 
grams, RLP($), is the set of all closed terms in OT(<l>). 
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Definition 4.7 ('Semantics of /iX.a(X)/ The associated pointed action model to fiX.a{X) 
is obtained by replacing the actual state Sa{x) of the action model Na(x) for the state sx ■ 

The associated action model of fj,X.a{X) is actually a fixed point of the open action 



model of a{X), see Example 4.4 



Example 4.8 The semantics of the recursive learning program 

a := fiXi.L,{x\afiX2.La{ip\bLbmaX2) n {^P\cXl)))) 
is constructed in the following way. The model of the open term 

La{ip\bLbmaX2) n i^lcX^))) 

is shown in Figure 14. 



(S 



^La(,'fi\bLim^X2)nli,\cXi))) 



a. isxs 



c[sx,\ 



Figure 14 



The model of the open term 



llX2.Laiip\bLbmaX2) H (V'lc^l))) 



is shown in Figure 15. 



^ilX2.La('f\bLt,i(li>\aX2)n{iicXl))) 



4-^xA 



Figure 15 
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The action model of 

Lc{x\af^X2.La{^\bLbmaX2) H (Vlc^l)))) 

is illustrated in Figure 16. 



^f(X2.La(^HtLb((*UX2)n(li|eXl))) 



•^[^A-i] 



Figure 16 
Finally the action model of the recursive learning program a is shown in Figure 17. 




b 







Figure 17 

As two other examples, one may check that the action model associated to the recursive 
learning program 

is the pointed action model illustrated in Figure 18, 
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Si.,X.L„(.f\l,I^Y.Li{i,\aXni:\cL^(e\iY))) 



Figure 18 
and the action model associated to the recursive learning program 

flX.La{(p\bLb{(p\aX),1p\bLb{ip\aX)) 

is bisimilar to the pointed action model {N, s) illustrated in Figure 19. 

{N, s) 

Figure 19 

4.2 Two Main Theorems 

In this subsection, we present two main theorems of the paper. In the first one, Theorem 



4.12, we show that every finite K45 action model is associated to some recursive learning 
program and conversely, for every learning program, there is a finite Kib action model 
associated with it. Then we introduce a hierarchy over learning programs with respect to 



the number of recursive learning operators. In our second main Theorem 4.15, it is shown 
that the hierarchy of the learning programs is strict, i.e., it is not possible to describe all 
K4:5 action models by a determined finite number of recursion in learning. That means 
that the hierarchy does not collapse. 

4.2.1 Representing Epistemic Action Models 



Definition 4.9 Let N = {S, 
an a- component of N is Ma 
submodel of N. 



^a)aeA,P^G) be a KAb action model. For each agent a ^ A, 
(S", (^'^)Qg|Q|,pre'), where Ma is an S5 closed connected 



Assume for all a € A, Ma = (5'*,( aja 

a-component of N . We construct the action model T'{N) 

follows: 



* "* g{a},P''eJj), 1 < i < ka are all different 

{S',{^'a)a€A,pre') as 



- S' = [jaeAUl<^<k^S:x{{^,a)}), 

— (s, (i,a)) ^ {t, {j,b)) if and only if either i 
b ^ a, and s -^b t (in the action model N ), 



j, a = b and s -^a ^' or i ^ j, 
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- pre'{{s,{i,a))) =pre\{s). 

The projection of an state (s, {i,a)) in the action modelT'[N), denoted hyYi{{s, {i, a))), 
is defined to be the state s in the action model N . 

Example 4.10 Consider the action models A^i and A'^2 in Figure 20. Their action models 
r'(iVi) and T'{N2) are illustrated in Figure 20 as well. 






N2 



<£)^^-^£ba 




r'(A'i) 



Figure 20 



&)^ 



T'{N2) 



Proposition 4.11 Let N = {S,{^^a)aeA,pre) be an K45 action model. Then {N,so) is 
bisimilar to {T'{N),wq) for all sq and wq, in which the state sq is the projection of wq. 



Proof. The proof is similar to the proof of Proposition 3.5 H 



Theorem 4.12 All finite epistemic actions are recursive learning programs, i.e., 

FAct($) = RLP($). 

Proof. See the Appendix. H 



Example 4.13 It is shown in Figure 21, how to construct a program for the pointed action 



model (A^i, s) through the instruction argued in proof of the Theorem 4.12 
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Figure 21 



4.2.2 A Hierarchy of Learning 



Definition 4.14 For each k & N, we define the class kRLP, of all finite KAf> pointed 
action models which can he described by a recursive learning program with at most k times 
of dependent use of the recursive operator /i. 

The term ^ dependent^ in the above definition is crucial. In a program, two operators ^X 
and fjY are called to be dependent if it is not possible to use one variable for both operators 
and achieve the same action model. 
For example, in the program, 

{x\af^X.La{^\hLh{ij\aX))) H {x\bf^Y.Lt{ij\aLa{ip\bY)), 

the operators /iX and fiY are independent. Note that the program 

(xlafJ'Z.Lai^lbLbiTplaZ))) D {x\bl^Z.Lb{lp\aLa{ip\bZ)), 

describes the same action model. In contrast with the above example, in the following 
program, 

liX.Laiip\bf^Y.Lb{4^\aX n tl;UL,{e\bY))), 

the operators iJ,X and fiY are dependent. 

It is easy to observe that the class ORLP is the class of all basic learning programs, 
BLP. We also wish to name IRLP as the class of primitive recursive learning programs and 
denote it also by PRLP. 

To prove the next theorem, we need to clarify some notions in graph theory. Let 
G = {N, E) be a directed graph. A simple loop L in the graph G is a sequence of 
nodes (sq, si, S2, •••, ^n), such that for all i < n, (si,Sj+i) G E, {sn,so) £ E, and for all 
< i, j < n, if z 7^ j then Sj ^ Sj. We call sq the start-point of the simple loop L. Let 
L = {so,si,S2, ■■■,Sn) and L' = {sq,s'i,S2,...,s'^) be two simple loops. We say L' is con- 
nected to L by its start-point, if there exists < t < n, St ^ sq and st = s'q, and for all 
< i < n and 0<j<mili^t and j 7^ 0, then Sj / Sj. A k-nested loop is a sequence 
(Li,L2, ...,Lfc) of simple loops, such that for each i > 1, Li is connected to Li-i by its 
start-point. 
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Theorem 4.15 We have the following hierarchy of recursive learning programs 

BLP £ PRLP £ 2RLP £ ... £ kRLP £ (k + 1)RLP £ .... 
Moreover, it does not collapse to RLP, i.e, for all k, 

kRLP / RLP, 

Proof. See the Appendix. H 

5 Concluding Remarks and Further Work 

5.1 Related Works 

We may compare epistemic learning programs with other approaches, hke concurrent dy- 
namic epistemic logic [8j and epistemic programs . 

In concurrent dynamic epistemic logic [8] , an epistemic action is interpreted as a relation 
between 5*5 epistemic states and sets of S5 epistemic states. There are two main differences 
between the interpretation of epistemic action in concurrent dynamic epistemic logic and 
epistemic learning programs. 

1. An epistemic action in concurrent dynamic epistemic logic is a relation between epis- 
temic states whereas in epistemic learning programs, it is a function from epistemic 
states to epistemic states. 

2. Concurrent dynamic epistemic logic is just about S5 models whereas epistemic learn- 
ing programs also considers KAb models. 

Another difference is in the interpretation of the notion of learning. Our learning oper- 
ator is an operator on action models, and ^^(ai, 02, •••, Ofc) is a new action model, express- 
ing the condition that agents in B learn that an action among 01,02, ■■-,0^ has occurred, 
whereas the action ai has actually occurred. For example, Li,{?ip, ?ip) is an action model 
which says: 

(f is announced and agent b is suspicious whether he learns (/? or learns ip. 

One may compare the above learning program with the action Li,{\?ipL)?'tp) in dynamic 
epistemic logic, and observe that for all 5*5 epistemic state {M,s), we have that {M,s) x 
(-^Lj,(?(/5,?V')'*i6(?</',?V')) ^^ bisimilar to (M, s)[[Lft(!?(/pU?V')]]- However, note that the learning 
program Lfe(Lf,(?(/9, ?V')) is equal to the learning program Lh(?ip), whereas, in dynamic epis- 
temic logic, the action Lb{Lb{\?ifU7tp)) is equal to -L;,(!?(/9U?'0)- Thus there is a difference 
between the notion of learning we consider for learning programs and the notion of learning 
considered in dynamic epistemic logic. 

Despite of the above arguments, it seems possible to translate a class of action terms, 
say a, in concurrent dynamic epistemic logic to a recursive learning program tr(Q), such 
that for all S5 epistemic state (M, s), we have that {M,s) x {N^^j-{a) : hria)) is bisimilar to 
{M,s)[[o]]. 
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Another way to represent information change is via the notion of episteniic program 
introduced in [5] . Whereby the notion of action signature is introduced and by adding this 
notion to the propositional dynamic logic PDL [13l [3] , a logical language is obtained to 
represent information change. However in this setting, no learning operator is considered, 
and the information change is represented through action signature, alternative, sequential, 
and iteration compositions. We focus on different kinds of learning; as the primitive notion 
of information change is learning something by agents. 

Another work related to ours is [lOj where the epistemic programs are discussed by 
adding a parallel composition operator to non-deterministic sum and sequential composi- 
tion. 

We showed that all finite X45 action models can be described by recursive learning 
programs. It is also announced in [6] that every S5 action model can be described as a 
concurrent epistemic action. 

We showed that Kib models are models of the belief KDAh logic for applicable formulas. 
In this way, to preserve the belief consistency of an agent, the agent is absent in the states 
that conflicts his beliefs. A similar work has been done in |16j . which assumes that a rational 
agent rejects those incoming information which dispute his beliefs. 

By introducing KAb models and actions, we may think of a theory of multi-agent belief 
revision. A related work is \2\, which generalize AGM [T|, to a multi-agent belief revision 
theory. 

Our work presents a method to construct KAb action models through some basic con- 
structors. Also in |llj . it is introduced operators to compose epistemic models in order 
to construct large models by small components representing agents' partial observational 
information. 

5.2 Further work 

5.2.1 A functional Semantics 

As a semantics of epistemic learning programs, we associated a pointed action model to 
every basic learning program. We may propose a functional semantics for the basic learning 
programs, in the manner that each program is associated to a partial function from epistemic 
states Mod to Mod. In this semantics, the meaning of learning operator is different form 
the meaning we discussed in the introduction. Here, learning in epistemic states (M, s) 
deals with two things, a set U of states of M. which includes the actual state s, and a set 
of agents B (^ A (where A refers to the set of all agents). Learning with B C. A and U (^ S 
in the epistemic states (M, s) means that: 

agents in B become aware that the actual state is among the states in U , and 
other agents in A — B believe that nothing has occurred. 

Let $ be a set of epistemic formulas over a set of atomic formulas P and a set of agents 
A. To each a G BLP{^), we associate a pair {fa^Ua), where fa ■ Mod — )• Mod is a 
partial function ,and for each epistemic state {M,s), Ua{{M,s)) is a subset of S", where 
fa{{M,s)) = {M',s') with M' = {S',{^[)i^AX)- 

For a recursive learning program iiX.a{X), the associated partial function should sat- 
isfy the fixed point equation, i.e., f^x.a(x) = fa ° f^iX.a(x)- As our forthcoming work, 
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we aim to study this functional semantics. It seems to us that functional semantics and 
recursive learning take us beyond the action models, that is, by functional semantics, epis- 
temic learning programs can encodes information changes which cannot be encode by action 
models. 

5.2.2 A Logic for RLP 

We need to provide a proof system for RLP as it is done for other approaches, like concurrent 
dynamic epistemic logic [H] , and action models [H [3] . 

5.2.3 Notions of Learning 



In Introduction, we put forward two meanings for 1. pointed action models (see 2.1) and 



2. learning of an action model (see 2.2). We supposed that an action model describes 
what is announced and what agents perceive based on their accesses to the resource of 
announcement. We also assumed that the learning of an action by a set of agents is to learn 
about the way information change. So our meanings of action models and learning refer to 
the occurrence of information change. 

We may propose two other meanings for pointed action models and learning of an action 
model, which refer to disability in information change. In this way, an action model describes 
the disability of agents in hearing or accessing the resource of announcement. For example, 
the new meaning of the pointed action model (A'^i, s) in Figure 1, 



© 



is 



Figure 1 

"in the case of announcement of f agent a hears ^. " 

Note that the above meaning does not speak about what occurs in information change, but 
it just describes a disability of agent a. Suppose ip = green and ip = blue. The new meaning 
of the pointed action model (A'^i, s) is that agent a has a color-blindness and if a green ball 
is shown to her then she thinks that she sees a blue ball. Similarly, the meaning of learning 
an action changes. The new meaning is learning about disability not about occurrence. The 
learning of an action by a set of agents is to learn about the disabilities that the agents 
have. Figure 22 shows two pointed action models where both refer to La{{Ni,s)) (agent o 
learns the pointed action model (A'^i, s)), but one considers the occurrence meaning and the 
other considers the disability meaning. 
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occurrence disability 

Figure 22 

In the occurrence meaning^ agent a learns that (/? is announced. In the disability meaning^ 
agent a becomes aware of her color-blindness, and after this learning, if she sees a blue ball, 
she is suspicious whether it is green or blue. 
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6 Appendix 

Lemma 6.1 Let M = {S, {^■a)aeA, y) be a Kripke model such that for each a £ A, ^^ is 
Euclidean. Then for all s £ S, if there is a state v £ S such that v ^a s then there exists 
t £ S such that s ^a t. 

Proof. Let S be a set of states and ^'C S x 5 be an Euclidean relation. Suppose s,v £ S 
are arbitrary and v ^- s. By Euclidean property, we derive s ^- s, and we are done. H 



Proof 3.5, Consider T{N) = {S' ,{^'JaeA,pre'). Define TZ C S x S' as follows. For all 



s £ S and w £ S' , 

sTZw if and only if n(ti;) = s. 

We show that 7^ is a bisimulation relation. Suppose all different the maximal closed con- 
nected 5*5 submodels of A^ are M^,M'^, ... and M^. Assume sTZw. Then w = {s,i), for 
some 1 < i < k. 

- Forth. Let s ^-a t. Either a £ B^ or a ^ B^. In the first case, since M* is closed, we 
have t £ S^ and thus {s,i) ^^ {t,i), and since tTZ{t,i), we are done. In the second 



case, since A^ is a KAf) model, and s -^a t, we have t ^^ t, by Lemma 6.1 Therefore 



there exists a maximal closed connected 5*5 submodel of A^, say M^ , such that t £ S^ 



and a £ BK By Definition 3.4, {s,i) -^'^^ {t,j), and since tTZ{t,j) we are done. 

- Back. Suppose {s,i) ^^'^ {t,j)- Then by Definition |3.4[ s ^a t, and we are done. 

- Pre. It is straightforward. 

30 



H 

Proof J3T6| The proof is by induction on the structure of basic learning programs. 

First of ah, note that for each epistemic formula (^, the graph of the action model 
(A''?(p, s?ip) is a tree. 

Let a be a basic learning program and its graph G{Na) be a tree. We show that the graph 
G{N^i^) is a tree, for any arbitrary formula ip and B C group{a). The maximal closed 
connected 5*5 submodels of the action model N^i^ are all the maximal closed connected 
55 submodels of Na, say M^, M^, ..., M^, and the maximal closed connected S5 submodel 
containing the state s^i^a, which is M° = ({s^|sa}, (^a)ae0;P'^e°(s^|g„) = i/j). One may 
check that 

1. for all i,j > 1, the edges between two nodes M^,M^ in the graph G(A^^|g„) are the 
same edges in the graph G{Na), 

2. for all i > 1, there is no directed edge from M* to M^, as group of M^ is empty, 

3. for ah z > 1, M° ^ M* if and only if there exists t e S\ a G B' D B, such that 
Sa ^a t in the model Na. 

Hence, if the graph G{Na) has no loop, the graph of G{N^\ga) would have no loop as well. 

Let ai and 02 be two basic learning programs such that group{ai) n group{a2) = 0, 
and pre{ai) = pre{a2) and G(A^aj) and G{Na^) are trees. Then the graph G{Nci^f^a2) is 
a tree for the following reasons. The maximal closed connected 55 submodels of Nair\a2 
consist of all the maximal closed connected 55 submodels of N^^^ and all the maximal 
closed connected 55 submodels of A^ai, and the maximal closed connected 55 submodel 
containing the state So-inoa, which is Af° = ({sainaa}, (^a)ae0)P'^e°(saina2) = pre{ai)). 
One may check that, since group{ai) n group{a2) = 0, there is no edge between the nodes 
of the subtrees G^Na^) and G{Na2)- So if the graphs G{Na^) and G[Na2) have no loop, 
the graph of G{Nct^r\a2) would have no loop as well. 

Assume ai, 02 j ••• and Um are basic learning programs such that their graphs have no 
loop. The the maximal closed connected 55 submodels oi Ni^g(^ai,a2,...,am) ^'^^ ^^^ followings: 

1. the 55 model Mq consists of m states {sa-^^., 1), {sa2i l), •••, (■Sq^, 1), with accessibility 



relations produced by agent-bisimilarity of group B (see Definition 3.2). 
2. all the maximal closed connected 55 submodels of Na^, Na^, ..., Na^. 

The node Mq is the root of the graph G{Ni^g(^ax,a2,...,am))-> ^^"^ ^^^ the graphs G{Nai), 
G{Na^), ..., G{Na^) are disjoint subgraphs of G(A''^^(„jQ,2,,,^Q,^)), such that the root may be 
connected to them. Hence if G(A''q,^), G{Na2), ■■■, G{Na^) are trees, then G{Nig(^ai,a2,...,a,n)) 
is a tree. H 



Proof sTsj Suppose {N,sq) is an 55 pointed action model. Let N = {S,{^^a)aeA,pi"e), 



where 5 = {sq, si, ..., Sfc}) and A = {oi, 02, ...,am}- As the model is 55, all accessibility 
relations are equivalence relations. For each agent aj, let Pi = {Dj,Df,...,D^''} be the 
equivalence classes of the relation ^a. , which partitions the set 5. Consider rii epistemic 
formulas i^i,i,tpi,2, ■■■,ipi,ni, where none of them are KD45 equivalent to each other. For 
each Sj, < j < k, consider the basic learning program Oj 
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where each Pj^i is a basic learning program defined as follows 

where Sj S -D[^. 

The action model associated to the basic learning program L^(ao,ai,a25 •••^Ofc) is 
{N, So). Since for each agent a, the action models of two programs aj and Oj are a-bisimilar 
if and only if Sj and s, are in the same equivalence classes induced by the relation -^a- ^ 



Proof 3.9 , By induction on the height of the tree. Let h^ be the height of the tree G{N). 



If Hn = 1, the action model A^ is an 5*5 model and by the Proposition 3.8, {N,s) is a 
basic learning action. Suppose that for all h < k, ii the graph of a K4:5 pointed model 
(M, t) is a tree with height h, then (M, t) is a basic learning action. Assume h^ = k. Let 
M^,M^, ...,M^~^ are all maximal connected closed submodels of N, and s is an state of 
M^. We consider M^ as the root of the tree, and delete all nodes which are not reachable 
from M^. That is because we want to state a program for the pointed action model (A^, s), 
and by deleting those maximal connected closed submodels which are not reachable from 
s, we do not loose anything up to bisimilarity. Let M^ = (5'°, (^a)ae_BO)P^e''), where 
5" = {s'l, $2, ..., s^}, B^ = {a°, 02, ..., flml ^^d s^ = s. As M" is an 55 action model, by using 
Proposition |3.8| there is a basic learning program LQo{ai,a2, ■■■,an), such that its action 



model is (Af'^s^), and each a^ corresponds to a state s^ (see the proof of Proposition 3.8). 
For each s^, let {b\,b2, ...,^J.} = g'''oup']r'(^]^^{s^) — B^, where groupT[N){s^) is the group of 
agents of the state s^ in the model T{N). For each 1 < j < Zj, if s^ ^^i tjj, for some state 
tjj of the model T{N), then as Mq is the root and is not accessible from tj, the graph of 
the pointed action model {T{N),tij) is a tree with height less than k, and by the induction 
hypothesis, there is a basic leaning program f3ij such that its action model is {T(N),tj). 
For each s?, let 

7, = Oi n (pre(sO) |fe.A,i) n (pre(sO)|^A,2) n ... n {pre{s'^)\^^(3^J. 

The program L^o{-yi,^2, ■■■,7n) is the desired program, that is, its associated action model 

is {N,s). H 



ProofJ4.12 Let {N,so) be a K45 pointed action model. Consider the action model 



T'{N). One may check that the followings hold true. Suppose Mf and M2 are two different 
components in T'{N), then 

1. if a = 6, as both components are closed S5 models, there is no accessibility relation 
for agent a between the two components, 

2. if for some state s G Mf and t € Mg, we have s ^b t, then for all v G Mg, we have 
s ^f, V, by transitivity and connectedness of M2. 

Let no,ni,n2, ...,nfc be all different components of the model N. Also suppose no is a 
component in which the actual state sq appears. The model T'{N) is a directed labeled 
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graph in which the nodes are no, ni, n2, ..., n^, and the edges are agents in ^. To each node 
nj, we correspond a variable Xj. 

If the graph is a tree, then we are done and then we can construct a basic learning 
program describing (N, sq). If the graph is not a tree, we unwind it to an infinite tree with 
the root uq. 
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Figure 23 

In the unwound infinite tree, there could be infinite nodes with the same name, say Uj. For 
all nodes w of the unwound tree, if ti; is a node with name Uj (for some i) and exactly one 
of its parents has the same name Uj , then we cut the subtree rooted from w and change the 
name of w from Uj to variable Xj. In this way, a finite tree T"{N) is obtained. 

Now we are ready to construct the desired program. We start from down to the top of 
the finite tree T"{N). 

1. First note that each leaf of the tree is either a variable or an a-component. If it is an 
o-component, then we associate to that leaf, the program 

La(?pre(fi), ?pre{v2), ■■■, ?pre{vm), 

where vi,V2, ■■■,Vm are all the states of the component. We note that as the a- 
component is a connected S5 action model, it is associated to the program 

La(7pre{vi), ?pre{v2), ...,pre{vm)- 

2. Suppose that Uj is the name of a node w in T"(N), which either all of its children are 
corresponded to a variable or a program. Two cases are possible: 

— Case 1 . Among the children of w there is no node corresponding to the variable 
with the same index j, that is Xj. 

For this case, suppose n^ refers to a 6-component with the states vi,V2, ■■■,Vm- 
For each state vi, and each agent a £ A, ii there is a directed edge with label a 
starting from the state vi to a children of w, say u, in the tree T"{N), consider 

pre{vi)\aPa,i 

where Pa,i is a program or variable corresponding to the node u. Then we 
associate to the node w, the program 
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LbinaeAPre{vi)\aPa,l,naeAPr4v2)\aPa,2,---,naeAPreiVm)\aPa,m) 

— Case 2. Among the children of n^ there are some nodes corresponding to the 
variable with the same index j, that is Xj. For this case, we do exactly the same 
as we did in the first case, thus obtaining the program 

Lb{r\aeAPM'"i)\aPa,l,f]aeAPr4'"2)\aPa,2,---,f]aeAPMVm)\aPa,m)- 

Then we associate the following program to the node 

IJ,Xj.Lbin^^j^pre{vi)\aPa,l,f]aeAP^^i'"2)\aPa,2,---,f]aeAP^(^i'"ni)\aPa,m) 

The program corresponding to the root of the tree T"{N) is a recursive learning program 
which describes pointed action model (A'^, sq). H 



Proof J4.15 In Theorem 3.6, it is proved that the graph of the action model of a basic 



learning program is a tree. So none of the operations: alternative learning, concurrent 
learning, wrong learning, produces any loops in the graph of a learning program. It is 



easily seen by Definition 4.7 that, the only operation that makes loops in the semantics of 
a learning program is the recursive learning operator. Therefore, if in a learning program, 
there exist k times of dependent use of the recursive operator /i, then there exists at most a 
/c-nested loop in its graph. That is, for each k €N, the graph of an action model associated 
to a program in kRLP has at most fc- nested loops. 

For each k > 0, we introduce a learning program a'^, such that its associated action model 
belongs to kRLP but not (k - 1)RLP. 

• k = 1. Let a^ = fiX .Lf,{ip\aLa{Tp\bLb{X))) , where ip is not logically equivalent to ip. 
The associated action model of the learning program a^ is the action model (N^, s) in 
Figure 3. Since (/? and ip are not logically equivalent, the two states {Ns, s) and {Ns,t) 
(see Figure 3) are not bisimilar. If there exists a program /3 without any recursive 
operator that its associated action model is bisimlar to (N^, s), then the action model 
A^3 would be bisimilar to a finite action model M' G FAct, such that its graph is a 
tree. Suppose -R is a bisimilarity relation between N^ and M', and sRs' . Because of 
bisimilarity, since s —^a t, there exists an state t' in model M' , such that tRt' and 
s' ^-'^ t' . Again, since t ^b s, there exists an state s" in M' , such that sRs" and 
t' ^^ s" . The model M' is a tree, so we have s" ^ s' , and as s and t are not bisimilar, 
we have s" ^ t' . Again, by bisimilarity, there exists an state t" in M', such that 
s" ^^ t" and tRt" . The new state t" is different from other states of M' , since M' 
has no loop. In this way, M' is an infinite model, and we derive a contradiction. 

• k = 2. The above argument can be done for fc = 2, by considering the associated action 
model of the program o^ = iJ,X.La{ip\bl^Y.Lh{ip\aX n il)\cLc{0\bY))) (see Figure 17), 
where none of the formulas <p, ip and 6 are logically equivalent. If there is a program 
/3 with at most one use of recursive operation, then the action model in Figure 17 
(which has a 2-nested loop) would be bisimilar to an action model M, that its graph 
has just one loop. This can easily be shown, since none of the states of the action 
model in Figure 18 are bisimilar to each other, so the action model M' cannot be 
finite. 
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So for any arbitrary k, we can construct an action model having one /c-nested loop, where 
none of its nodes are bisiniilar to each other. Then this action model is in kRLP but not 
(k-l)RLP. H 
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